Predicting the Category and Attributes of Mental Pictures Using Deep Gaze Pooling
نویسندگان
چکیده
Previous work focused on predicting visual search targets from human fixations but, in the real world, a specific target is often not known, e.g. when searching for a present for a friend. In this work we instead study the problem of predicting the mental picture, i.e. only an abstract idea instead of a specific target. This task is significantly more challenging given that mental pictures of the same target category can vary widely depending on personal biases, and given that characteristic target attributes can often not be verbalised explicitly. We instead propose to use gaze information as implicit information on users’ mental picture and present a novel gaze pooling layer to seamlessly integrate semantic and localized fixation information into a deep image representation. We show that we can robustly predict both the mental picture’s category as well as attributes on a novel dataset containing fixation data of 14 users searching for targets on a subset of the DeepFahion dataset. Our results have important implications for future search interfaces and suggest deep gaze pooling as a general-purpose approach for gaze-supported computer vision systems.
منابع مشابه
Early detection of MS in fMRI images using deep learning techniques
Introduction & Objective:MS is a disease of the central nervous system in which the body makes a defensive attack on its tissues. The disease can affect the brain and spinal cord, causing a wide range of potential symptoms, including balance, movement and vision problems. MRI and fMRI images are a very important tool in the diagnosis and treatment of MS. The aim of this study was to provide...
متن کاملPredicting of the Quality Attributes of Orange Fruit Using Hyperspectral Images
Background: Hyperspectral image analysis is a fast and non-destructive technique that is being used to measure quality attributes of food products. This research investigated the feasibility of predicting internal quality attributes, such as Total Soluble Solids (TSS), pH, Titratable Acidity (TA), and maturity index (TSS/TA); and external quality attributes such as color components (L*, a*, b*)...
متن کاملThe Female gaze in proportion to pictorial elements in "A parrot with fruit and a portrait of a girl"
Qajar painting influenced Iran’s painting with a new kind of illustrations originating from the past traditions. Art and cultural politics of Fath Ali Shah performs an obvious role amongst the influential agents and historical events in the era of Qajar paintings for a presentation of the concepts of power in the social, political and cultural arena. Fath Ali Shah’s patronage of the art alters ...
متن کاملBag of Attributes for Video Event Retrieval
In this paper, we present the Bag-of-Attributes (BoA) model for video representation aiming at video event retrieval. The BoA model is based on a semantic feature space for representing videos, resulting in high-level video feature vectors. For creating a semantic space, i.e., the attribute space, we can train a classifier using a labeled image dataset, obtaining a classification model that can...
متن کاملPredicting the survival and dropout of addiction treatment interventions based on sensation seeking and impulsivity
The present study was conducted with the aim of predicting the survival and dropout of addiction treatment based on sensation seeking and impulsivity. The present study was descriptive-correlational. The statistical population of this study was all addicts in Ardabil city that came to one of the centers for addiction treatment and 349 of them were selected based on Krejcy and Morgan tables and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1611.10162 شماره
صفحات -
تاریخ انتشار 2016